The DoF Region of the Multiple- Antenna 
Time Correlated Interference Channel 



We consider the time-correlated multiple-antenna interference channel where the transmitters have 

(i) delayed channel state information (CSI) obtained from a latency-prone feedback channel as well as 

(ii) imperfect current CSIT, obtained e.g. from prediction on the basis of these past channel samples. 
We derive the degrees of freedom (DoF) region for the two-user multiple-antenna interference channel 
under such conditions. The proposed DoF achieving scheme exploits a particular combination of the 
space-time alignment protocol designed for fully outdated CSIT feedback channels (initially developed 
for the broadcast channel by Maddah-Ali et al, later extended to the interference channel by Vaze et al. 
and Ghasemi et al.) together with the use of simple zero-forcing (ZF) precoders. The essential ingredient 
lies in the quantization and feedback of the residual interference left after the application of the initial 
imperfect ZF precoder. Our focus is on the MISO setting albeit extensions to certain MIMO cases are 
also considered. 



Although the determination of capacity region of the interference channel (IC) has been a long standing 
open problem, several interesting recent results shed light on the problem from various perspectives. 
Among these we may cite the capacity region obtained for special cases [1-4], or obtained for general 
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channel classes in both scalar and MIMO settings up to approximations with bounded gaps [5, 6]. When 
specializing to the large SNR regime, it is known that the characterization of the full capacity region 
can be conveniently replaced with the determination of the so-called degree-of-freedom (DoF) region. 
Progress on that particular front was reported in [7] with the derivation of the DoF region for the two- 
user MIMO interference channel with Mi, M 2 transmit antennas and iVi, N 2 receive antennas, where the 
sum DoF min{Mi +M 2 ,N 1 + N 2 , max(Mi, N 2 ), max(M 2 , iVi)} is shown to be optimal. Most of these 
advances suggest achievable schemes which require the full knowledge of channel state information (CSI) 
at both the transmitter and receiver sides. In fact, the cruciality of CSI at the transmitter side in particular 
is demonstrated in such works as [8-10] where the DoF region is shown to shrink dramatically when 
zero CSIT is available. The intermediate scenario of limited or incomplete CSIT was also considered 
in [11, 12]. In [11], the rate of limited feedback needed to preserve the DoF optimality in interference 
alignment-enabled IC is provided. More recently, the impact of feedback delays providing the transmitter 
with outdated CSI over MIMO channels was considered in [13] for the broadcast channel (BC) and 
later extended to the IC [14, 15]. The key contribution in [13] was to establish the usefulness of even 
completely outdated channel state information in designing precoders achieving significantly better DoF 
than what is obtained without any CSIT. Considering the worst case scenarios, including those where 
the feedback delay extends beyond the coherence period of the time varying fading channels, the authors 
in [13] propose a space-time interference alignment-inspired strategy achieving an optimal sum DoF 
of 4/3 for the two-user MISO BC, in a setting when the no CSIT case yields no more than 1 DoF. 
The essential ingredient for the proposed scheme in [13] lies in the use of multi-slot protocol initiating 
with the transmission of unprecoded information symbols to the user terminals, followed by the analog 
forwarding of the interference overheard in the first time slot. 

Recently, this strategy was generalized under similar principle to the interference channel setting [14, 
15], again establishing DoF strictly beyond the ones obtained without CSIT in scenarios where the delayed 
CSIT bears no correlation with the current channel realization. 

Albeit inspiring and fascinating in nature, such results nonetheless rely on the somewhat over-pessimistic 
assumption that no estimate for the current channel realization is available to the transmitter. Owing to the 
finite Doppler spread behavior of fading channels, it is however the case in many real life situations that 
the past channel realizations can provide information about the current one. Therefore a scenario where 
the transmitter is endowed with delayed CSI in addition to some (albeit imperfect) estimate of the current 
channel is practical relevance. This form of combined delayed and imperfect current CSIT was recently 
introduced in [16] for the multiple-antenna broadcast channel whereby a novel transmission scheme is 
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proposed which extends beyond the MAT algorithm in allowing the exploitation of precoders designed 
based on the current CSIT estimate. The full characterization of the optimal DoF for the hybrid CSIT 
was reported in [17] and independently in [18]. The key idea behind the schemes in [16, 17] lies in the 
modification of the MAT protocol where i) the initial time slot involves transmission of precoded symbols, 
followed by the forwarding of residual interference overheard in the first time slot, and ii) the taking 
advantage of the reduced power for the residual interference (compared with full power interference in 
MAT) based on a suitable quantization method and digital transmission. 

In this paper, we extend the results in [16, 17] and consider the two-user time-correlated multiple- 
antenna interference channel. A similar hybrid CSIT scenario is considered whereby each transmitter has 
access to delayed channel samples for the links it is connected to, as well as possessing an imperfect 
estimate of the current channel. The current CSIT estimate could be obtained from, e.g., a linear prediction 
applied to past samples [19,20], although the prediction aspects are not specified in this paper. Instead, 
the quality level for the current CSIT estimate is simply modeled in terms of an exponent of the transmit 
power level, allowing DoF characterization for various ranges of current CSIT quality. Thus our model 
bridges between previously reported CSIT scenarios such as the pure delayed CSIT of [13-15] and the 
pure instantaneous CSIT scenarios. We assume each receiver has access to its own perfect instantaneous 
CSI and the perfect delayed CSI of other receivers (as in e.g. [13-15]), in addition to the imperfect 
current CSI. 

In what follows we obtain the following key results: 

• We establish an outer bound on the DoF region for the two-user temporally-correlated MISO 
interference channel with perfect delayed and imperfect current CSIT, as a function of the current 
CSIT quality exponent. This result is initially derived for the two-antenna transmitters and then 
generalized. 

• We propose two schemes which achieve the key vertices of the outer bound with perfect delayed 
and imperfect current CSIT. The schemes build on the principles of time-slotted protocol, starting 
with the ZF precoded transmission of information symbols from the two interfering transmitters 
simultaneously and followed by forwarding of the residual interferences. As in the BC case, the 
residual interference reflects on the quality of the initial precoder and can be shown to be quantized 
and power scaled in a suitable way to achieve the optimal DoF. 

• Our results coincide with previously reported DoF results for the perfect CSIT setting (current CSIT 
of perfect quality) and pure delayed CSIT setting (current CSIT of zero quality). 

• The DoF region of certain MIMO cases is also provided as a function of the current CSIT quality 
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exponent and the number of receive antennas. 
Notation: Matrices and vectors are represented as uppercase and lowercase letters, and matrix transport, 
Hermitian transport, inverse and determinant are denoted by A 1 , A", A^ 1 and det(A), respectively, h^- 
is the normalized orthogonal component of any nonzero vector h. The approximation f(P) ~ g{P) is 
in the sense of lirnp^oo = C, where C is a constant that does not scale as P. A >z means 
A is symmetric positive semidefinite if A is square and A < B means B — A is symmetric positive 
semidefinite if both A and B are squared matrices. 

II. System Model 

We consider a two-user MISO interference channel, where two transmitters each equipped with 2 
antennas 1 wish to send two private messages to their respective receivers each with a single antenna, as 
shown in Fig. 1. The discrete time baseband signal model is given by 



y(t) = h H u (t) Xl (t) + h H 12 {t)x 2 (t) + e(t) 
z{t) = h H 2l {t) Xl {t) + h H 22 {t)x 2 {t) + b{t), 



(la) 
(lb) 



for any time instant t, where hji(t) € C is the channel vector from Tx-i to Rx-j; e(t),b(t) ~ Nc (0, 1) 
are normalized additive white Gaussian noise (AWGN) at the respective receivers; the coded input signal 
Xi(t) is subject to the power constraint E(||a:j(i)|| 2 ) < P, VI 



Delay 
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Fig. 1: The two-user MISO interference channel. 



'The generalization to arbitrary number of antennas is considered in Section VI. 
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Assumption 1 (mutually independent fading). At any given time instant t, the channel vectors {hji(t)} 
are mutually independent and identically distributed (i.i.d.) with zero mean and covariance matrix 1 2 . 

Assumption 2 (perfect delayed local CSIT and imperfect current local CSIT). At each time instant t, Tx-i 
knows perfectly the delayed local CSIT {hu(k), h 2 i(k), k = 1, . . . , t— 1} (with which link it is respectively 
connected), and somehow predict/estimate imperfectly the current local CSIT {hu(t), h 2 i(t)}, which can 
be modeled by 

hji(t) = hji(t) + hji(t) (2) 

where the estimate hji(t) and estimation error hji(t) are independent and assumed to be zero-mean and 
with variance (1 — a 2 )I 2 , cr 2 I 2 , respectively (0 < a 2 < 1). 

Assumption 3 (perfect delayed CSIR, imperfect current CSIR and perfect current local CSIR). At each 
time instant t, Rx-i knows perfectly the delayed CSIR up to instant t — 1 for all links, i.e., {H(k)} t l z} 1 , 
where 

H(k) 4 {h u (k),h 12 (k),h 21 (k),h 22 (k)}, (3) 

and the imperfect current CSIR (similarly modeled as at the transmitters) up to instant t for all links, 
i.e., {H(k)Y k=l , where 

H(k)±{h u (k),h 12 (k),h 21 (k),h 22 (k)}, (4) 
as well as the perfect current local CSIR, i.e., {hn(t),hi 2 (t)}. 

We assume that the estimation error a 2 can be parameterized as an exponential function of the power 
P, so that we hope to characterize the DoF of the MISO IC with respect to this exponent. To this end, 
we introduce a parameter a > 0, such that 

a ,. logo- 2 
a = - hm -. (5) 

P^oo log-P 

This q indicates the quality of current CSIT at high SNR. While a = reflects the case with no current 
CSIT, ol — > 00 corresponds to that with perfect instantaneous CSIT As a matter of fact, when a > 1, 
the quality of the imperfect current CSIT is sufficient to avoid the DoF loss, and ZF precoding with this 
imperfect CSIT is able to achieve the maximum DoF [20]. Therefore, we focus on the case a £ [0, 1] 
hereafter. The connections between the above model and the linear prediction over existing time-correlated 
channel models with prescribed user mobility are highlighted in [16]. 
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According to the definition of the estimated current CSIT, we have 

EWMhfM 2 ) = Edh^mfm 2 ) +H\h" ji mfM 2 ) ( 6 ) 

= E(\~h" ji (t)h ji (t)\) (7) 

= o 2 (8) 

~ P~ a (9) 
where E(||/ij.(t)|| 2 ) = 1, and (9) is obtained from (5). 

III. The Degree of Freedom Region 

A rate pair (i?i, R 2 ) is said to be achievable for the two-user interference channel with perfect delayed 
CSIT and imperfect current CSIT if there exists a (2 nRl , 2 ni?2 , n) code scheme consists of: 

• two message sets [1 : 2 nRl ] at the Tx-1 and [1 : 2 nR2 } at the Tx-2, from which two independent 
messages Mi and M2 intended respectively to the Rx-1 and Rx-2 are uniformly chosen; 

• one encoding function at the Tx-i: 

Xi(t) = ft {M h {h lt (k)Y k - = \, {h 2i (k)}l- = \, {h u (k)}l =1 , {h 2i (k)}{ =1 ) ; (10) 

• and one decoding function at its corresponding receiver, e.g., 

Mj = 9i ({»(t)}w 1 W)}w,{ff(t)}li-'«iiW.^W) (ID 
for the Rx-1 when j = 1, and it is similarly defined for the Rx-2 by replacing y(t) with z(t), 

in) 

such that the average decoding error probability P e , defined as 

P e (B) =e(p((Mi,M 2 ) + (Mi,M 2 ))) , (12) 

vanishes as the code length n tends to infinity. The capacity region C is defined as the set of all achievable 
rate pairs. Accordingly, the DoF region can be defined as follows: 

Definition 1 (the degree-of-freedom region). The degree-of-freedom (DoF) region for two-user MISO 
interference channel is defined as 

2> = \ {di,d 2 ) eR 2 + \V( Wl ,w 2 ) eR 2 + ,w 1 d 1 +w 2 d2 <limsup [ sup W ^+^ R A I (13) 

Consequently, the DoF region for the two-user time-correlated MISO interference channel is stated in 
the following theorem. 
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Theorem 1. In the two-user MISO interference channel with perfect delayed CSIT and imperfect current 
CSIT (as stated in Assumption 2), the optimal DoF region can be characterized by 



di < 1 

d 2 < 1 
2di + d 2 < 2 + a 
d x + 2d 2 <2 + a. 



(14a) 
(14b) 
(14c) 
(14d) 



Remark: Interestingly, the above DoF region is identical to that of the two-user MISO broadcast 
channel with perfect delayed CSIT and imperfect current CSIT [17, 18]. In fact, this result is consistent 
with previous results on the pure delayed CSIT case (a = 0) where it was shown that the DoF region 
for the two-user BC and the two-user IC coincides, and also on the special case of perfect instantaneous 
CSIT (a = 1). 

For illustration, the DoF region for the two-user MISO IC is provided in Fig. 2. The DoF regions with 
no CSIT, pure perfect delayed CSIT, and perfect instantaneous CSIT are also plotted for comparison. 
It shows that the DoF region with perfect delayed CSIT and imperfect current CSIT is strictly larger 
than that with pure delayed CSIT and quickly approaches the region with perfect CSIT as the quality of 
current CSIT increases. 
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Fig. 2: DoF region for the two-user MISO interference channel (when a = 0.5). 
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Given an a, the DoF region is a polygon whose vertices are: (0,1), (a, 1), (^4p,^4p), (l,a) and 
(1,0). In the following, we first characterize the outer bound, and then propose two schemes to show 
they are achievable, and in turn the entire region can be achieved by time sharing. 

IV. Outer Bound 

We adopt a strategy reminisced in [14] to obtain the genie-aided outer bound, by assuming that (i) 
both receivers know the CSI H(t) perfectly and instantaneously as well as the imperfect current CSI 
H(t) at time t, and (ii) the Rx-2 has the instantaneous knowledge of the Tx-l's received signal y{t). 

Define 

y'(t)±h" 12 (t)x 2 (t) + e(t) (15) 
z'(t)±h" 22 (t)x 2 (t) + b(t) (16) 
7±{H(t),H(t)}? =1 (17) 

W) 4 {{y\i)}UAz'mtl{Hii)}UAH(i)}i=i} (18) 

where 7 denotes the channel state information and its estimated version available at the receivers from 
the beginning up to time instant n. 
To ease our presentation, we denote: 

ne n = 1 + ni?P e (n) (19) 

(n) 

where e n tends to zero as n — > oo by the assumption that linin^oo P e = 0. Then, we can upper-bound 
the achievable rate of Rx-1 by applying Fano's inequality: 

nRi (20) 

<I(M i; {y(t)}? =1 \7)+ne n (21) 

= I(M 1 ,M 2 ;{y(t)}lU\'J) - I(M 2 ;{y(t)}l l =1 \M 1 ,7) +ne n (22) 

< nlogP - 7(M 2 ; {y(t)}™ =1 \Mi,7) + n ■ 0(1) + ne n (23) 
= n log P - h({y(t)}? =1 |Mi, 7) + h({y(t)}? =1 \M U M 2 , 7) + n ■ 0(1) + ne n (24) 
= nlogP - /i({t/(t)}^ =1 |Mi,T) + n ■ 0(1) + ne n (25) 
= n log P - h({y'(t)}? =1 \7)+n- 0(1) + ne n (26) 

n 

< nlogP - Hy'(t)\7, {y'mzl M*)}^ 1 ) + n ■ O(l) + ne n (27) 

t=l 

n 

= n\ogP-J2 h(y'(t)\U(t),H{t)) + n ■ 0(1) + ne n (28) 

t=i 
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where (23) follows the fact that the rate of the point-to-point MISO channel (i.e., Tx-1 together 
with Tx-2 are treated as the transmitter by cooperation while Rx-1 as the receiver) is bounded by 
nlogP + n ■ 0(1); (25) is due to the fact that (a) transmitted signals {cCj(t)}™ =1 are determined given 
messages, channel matrices up to n and the encoding functions defined in (10), (b) translation does change 
differential entropy, and (c) noise is independent of the channel matrices, the transmitted signals and 
the messages; (26) is obtained because (a) the transmitted signals {a?i(t)}" =1 are determined provided 
the channel matrices, Mi and the encoding functions according to (10), and (b) translation preserves 
differential entropy; (27) follows the chain rule of differential entropy and the fact that conditioning 
reduces differential entropy; the last equality is obtained due to y'(t) is independent of {H(k)}^ =t+1 
and {HimU+v 

By applying Fano's inequality, we then also upper-bound the achievable rate of Rx-2 as 

nR 2 (29) 

< J(M 2 ; {y(t)}UA<t)}t=i W + ne n (30) 

< J(M 2 ; {y{t)^ =1 ,{z(t)YU^i\7) + ne n (31) 
= J(M 2 ; {y(t)}t=i, |M!, 7) + ne n (32) 
= J(M 2 ; W(t)}t=i, W(.t)Yt=i W + ne n (33) 

n 

= £ K.M 2 ;y\t),z'(t)\7, Wmz{, {z'{i))t\ ) + ne n 
t=i 

n 

<^/(^(t); y '(t),z'(t)|T,{ y '(z)}*- 1 1 ,{z'( i )}^) + n6 n (34) 
t=i 

n 

= £ (h(y'(t),z'(t)\7,{y'(i)}lzl{z'(i)}lZl) 
t=i 

-h(y'(t),zXt)\x 2 (t),7,{y>(i)}izlAz\i)}tl))+ne n (35) 

n 

< Hy\t), z '(t)\7, wmtl wmtl) + ^ (36) 

n 

= 5^ h(y'(t), z'(t)\U(t), H(t)) + ne n (37) 

where (32) is obtained because of the chain rule of mutual information and the independence between 
Mi and M 2 ; (33) is due to (a) the transmitted signals {xi(t)}f =l are determined given message Mi, 
channel matrices and encoding functions, and (b) M 2 and {y' (t) , z' (t)} are independent of Mi; (34) 
is obtained by Markov chain M 2 — >• x 2 {t) — > {y' (t) , z' (t)} and the data processing inequality; (36) is 
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because (a) translation does not change differential entropy, (b) Gaussian noise terms are independent 
from instant to instant, and are also independent of the channel matrices and the transmitted signals, and 
(c) the differential entropy of Gaussian noise is nonnegative; and the last equality is obtained due to the 
independence {y'(t), z'(t)} of {H(k)}% =t+1 and {H {k)} n k=t+1 . 

According to the Markov chain {x 2 (t)}™ =1 — > ({y(t)}f =1 , {z(t)}f =1 ) — > {y(t)}" =1 , we upper-bound 
the weighted sum rate as 



n(2i?i + R 2 ) 



(38) 



< 2nlogP + ]T (h(y'(t),z'(t)\U(t), H(t)) - 2h(y' (t)\U(t) , H(t))) + n ■ O(l) + ne n . (39) 



i=l 



Before preceding further, we introduce the following lemma stated in [17]. 

Lemma 1 ( [17]). For an m x 1 random vector h = h + h where h ~ Nc(0,<7 2 I m ) is independent of 
h, given any K >z with eigenvalues Ai > ■ ■ ■ > A m , we have the following upper and lower bounds: 

log(l + e-V 2 Ai) + 0(l) <E^log(l + h H Kh) < log(l + ||/*|| 2 Ai) + 0(1). (40) 

The difference of the upper and lower bounds can be further bounded by 

log(l + ||/i|| 2 Ai) - log(l + e" 7 a 2 Ai) < - log(a 2 ) + 0(1) (41) 

where 7 is Euler's constant. 



S(t) 
S(t) 



With the definitions 

r h\ 2 {t) 

hut) 

w(t) 4 [e (t) b(t)Y (44) 
K(t)±E{x 2 (t)x H 2 (t)\U(t)} (45) 
we further upper-bound the weighted difference of two conditional differential entropies that derived 



(42) 



(43) 
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above, i.e., 

h(y'(t),z'(t)\U(t), H{t)) - 2h(y'(t)\U(t), H{t)) (46) 

= h(y'(t), z'(t)\U(t), S(t)) - 2h(y'(t)\U(t), S(t)) (47) 

< max max (h(y'(t), z'(t)\U(t), S(t)) - 2h(y'(t)\U(t), S(t))) (48) 

G>;0, p(U(t)), 
b(C)<P p ( X2 (t)\U(t)) 
K(t)<C 

= max max {h(S(t)u(t) + w(t)\U(t), S(t)) - 2h{h\ 2 (t)u{t) + e{t)\U(t), S(t))) (49) 
cyo, p(u(t)) 

tr(C)<P K u {t)<C 

= max max (h(S(t)u(t) + w(t)\S(t), S{t)) - 2h(h" 12 (t)u(t) + e(t)\S{t), S(t))) (50) 

ChO, p(S(t)) 

= max max E s(t) g (t) (logdet(I + S{t)K u (t)S H (t)) - 21og(l + h" 12 (t)K u (t)h 12 (t))) (51) 
tr(Cj<P Kjt)% 

<E S{t) max max E s(t)|i§(t) (Iogdet(I + S(t)K u (t)S"(t)) - 2 log(l + h H 12 (t)K u (t)h 12 (t))) (52) 

tr(C]<P^% 

<E §(t) max max E sft)|d(t) (Iog(l + h" 22 {t)K u {t)h 22 {t))) - Iog(l + h" 12 {t)K u {t)h 12 {t))) (53) 

tr(c)<p^ (t yb 

<E dm max max (log(l + ||h2 2 (t)|| 2 Ai(Jif u (t))) - log (l + e^X^K^t)))) + 0(1) (54) 

^ > C>;0, p(S(t)) V ' 

<alogP + 0(l) (55) 

where in (47) H{t) is replaced by because of the independence of {y'(t), z'(t)}; (49) is ob- 
tained because Gaussian distributed vector u(t) maximizes the weighted difference of two differential 
entropies over all conditional distribution of X2(t) with the same covariance matrix constraint, where 
K u (t) = K{u(t)u H (t)} = m£Lx p m(t)) K(t) [21]; (50) is because u(t), S(t) and w(t) are independent of 
U(t) except S(t); (51) is obtained because u(t) is Gaussian distributed and independent of {H(t)}^ =1 , 
{H(t)}™ =1 as well as the noise terms; (52) follows the fact that putting the expectation out of the 
maximization increases the value; (53) follows from the inequality det(I + A) < YYiLi (1 + a u) where 
A is an m x m positive semidefinite matrix with entry a^ ; the last two inequalities are according to 
Lemma 1 and the quality of current CSIT (a 2 ~ P~ a ). 

Accordingly, we obtain an upper bound of 2R\ + R 2 from (39) and (55), i.e., 

n(2i?i + R 2 ) < n(2 + a) log P + n ■ 0(1) + ne n (56) 

as n — > 00, from which (14c) is obtained according to the definition of DoF. 
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By exchanging the roles of Tx-l/Rx-1 and Tx-2/Rx-2, another inequality (14d) can be similarly obtained 
by assuming Rx-1 has the instantaneous knowledge of z(t), where the weighted rate is bounded as 



Together with the first two bounds (14a) and (14b) which are obtained by the constraint of antenna 
configuration, the DoF region is completely characterized. 



With perfect delayed CSIT, the authors in [14] and [15] characterize the DoF region for two-user MIMO 
interference channel, bridging between the case with no CSIT [8-10] and that with perfect instantaneous 
CSIT [7]. Particularly, for two-user MISO case, the DoF pair (§,§) is achievable with delayed CSIT, 
strictly larger than (\,\) achieved with no CIST and dominated by (1, 1) with perfect CSIT 

The technique exploits the advantage of interference alignment in the time domain by utilizing the 
delayed CSIT together with the space domain, which is referred to as MAT alignment [13-15]. We first 
briefly review its application in the interference channel. 

A. MAT in the Interference Channel 

The MAT alignment in the interference channel is an extension from the broadcast channel, taking 
into account the distributive and uncooperative nature of the transmitters [15]. The two-phase protocol 
which consumes three time slots is described as follows: 

Phase-1: In this phase, each Tx transmits two independent encoded symbols to its intended receiver 
without precoding during a single time slot, i.e., 



n{Ri + 2R 2 ) < n(2 + a) log P + n ■ 0(1) + ne 



(57) 



V. ACHIEVABILITY 



*l(l) 



u 



(58a) 



352(1) 



V 



(58b) 



and the received signals at both receivers are 



y(i) 



/i H n (l> + /i H 12 (l)«+e(l) 



(59a) 




z(l) 



h" 21 (l)u+h" 22 (l)v + b(l), 



(59b) 




where 7/1 and r\ 2 are interference terms overheard at the Tx-1 and Tx-2, respectively. 
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Phase-II: At the end of phase-I, the delayed CSIT {/i2i(l)} is available at the Tx-1, while {^12(1)} 
is accessible at the Tx-2. Together with the transmitted symbols, the overheard interference terms are 
reconstructible at both Txs. By retransmitting the overheard interference terms i] 2 = /i 2 l i(l)' u at th e Tx-1 
and 771 = h*[ 2 {2)v at the Tx-2 with time division, i.e., 



asi(2) = 
x 2 (2) = 



and 



xi(3) = 
as 2 (3) = 







(60a) 
(60b) 

(61a) 
(61b) 



where two entire time slots are consumed, we cancel the interference terms rji and 772 at the Rx-1 and 
Rx-2, and importantly provide another linear combination of u (from 772) and v (from 771) to the Rx-1 and 
Rx-2, respectively. By the end of phase-II, both receivers are able to recover their own symbols with high 
probability. The key idea behind is interference repetition and alignment in both space and time domain. 
At each receiver, the mutual interference aligns in one dimension, while the desired signal spans in a 
two-dimensional space. This enables each receiver to retrieve the desired signal from a three-dimensional 
space. 



B. Integrating the Imperfect Current CSIT 

The MAT alignment takes into account the completely outdated CSIT, regardless of the correlation 
between current and previous channel states. As a matter of fact, such an assumption on the delayed CSIT 
is over-pessimistic, since the current CSI can be predicted from the past states if the underlying channel 
exhibits some temporal correlation. Recent results demonstrate that the DoF region can be enlarged in 
broadcast channel by using estimated current CSIT, even it is imperfect [16, 17]. 

In the following, two schemes are proposed, demonstrating the larger DoF region can be achieved 
by utilizing estimated current CSIT exploited from time correlation model. Instead of forwarding the 
interference terms in an analog fashion [14, 15], we first quantize the interference and then retransmit 
the quantized version. By utilizing the imperfect current CSIT for precoding, the interference terms are 
efficiently compressed with quantization. 
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In the following two schemes, we demonstrate the vertices (pp, pp) an d (l,a) are all achievable. 
Note that we simply use hji(t) for the range space while hji(t) for the null space of hji(t). The precoder 
design to improve the achievable rate is out of scope of this paper. 

1) Achievability of (pp, pp)-' Inspired by the enhanced scheme for the two-user MISO broadcast 
channel [17], a 3-time-slotted protocol, which achieves the vertex (pp, pp) of the DoF region for the 
two-user MISO interference channel, is detailed as follows. 

Slot-1: In the first time slot, the symbol vectors u(l) and v(l) are respectively sent from the two 
transmitters with precoding, heading to their corresponding receivers: 

xi(l) = [h 21 (l) h£(l)]ti(l) (62a) 

x 2 (l) = [hn(l) h± 2 (l)]v(l) (62b) 

where u(l) = [«i(l) u 2 (l)] T , v(l) = [vi(l) v 2 (l)] T satisfy E(||w(l)|| 2 ) = E(||t>(l)|| 2 ) < P. The received 
signal at both receivers are then given as: 

y(l) = h" 11 (l)x 1 {\) + Vl + e(l) (63a) 

z(l) = h" 22 (l)x 2 (l) + m + b(l), (63b) 

where r\\ and rj 2 are interferences overheard at the Rx-1 and Rx-2 respectively, i.e., 

m = h" 12 (l)h 12 (l) Vl (l) + ^12(1)^12(1)^2(1) (64a) 
m = h" 21 (l)h 21 (l) Ul (l) + ^(1)^(1)^(1). (64b) 

According to (9), i.e., | 2 ) ~ P~ a , we can make E(|t/i [ 2 ) = E(\r] 2 \ 2 ) ~ P 1 "" by allocating 

E(|n!(l)| 2 ) = E(|vi(l)| 2 ) = P x ' a whereas E(|u 2 (l)| 2 ) = E(|v 2 (l)| 2 ) = P - P x ' a ~ P. 

At the end of slot-1, Tx-1 can reconstruct rj 2 = ^ 2 i(l) :E 2(l) while Tx-2 can reconstruct rji = 
^2i(l) a; i(l)- Instead of forwarding the interferences in an analog fashion, we first quantize the interfer- 
ence term rji into fji with (1 — a) log P bits each, then encode the index of rji to codeword q using a 
Gaussian channel codebook, and forward q as a common message to both receivers in the ensuing two 
time slots. To ease our presentation, the process of the encoding and decoding of q is omitted hereafter, 
making it look as if the codeword fji itself is conveyed to the receivers 2 . 

2 The simplification does not affect our results as long as we consider DoF region. For the rate region, the general scheme 
and more rigorous proof can be straightforwardly extended from [17]. 
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The source codebook Xi (resp. X2) is generated for 772 (resp. 771) and maintained at the Tx-1 (resp. Tx- 
2). The entry fji in codebook Xj satisfies 

r}i = fji + Aj (65) 

where Aj is the quantization error with distortion E( | Aj | 2 ) ~ c 2 t -D and independent of r/j. According to 
the rate distortion theory [22], we let the normalized distortion D decay as p~( l - a ) ( m turn E(|Aj| 2 ) ~ 
P°) so that each receiver can decode it successfully and the quantization error is drowned in the noise. 

Slot-2: During the second time slot, the index corresponding to 7)2 is encoded to c 2 and sent from 
Tx-1 as a common message together with a new symbol u(2) with ZF precoding, while a new symbol 
v(2) intended to Rx-2 is instantaneously sent from Tx-2 with ZF precoding as well. By omitting the 
encoding and decoding process of C2, the equivalent transmitted signals can be written as 

P a / 2 rj 2 





*i(2) 



+ /j, 2 L i(2)w(2) (66a) 



x 2 {2) = hi 2 (2)v(2). (66b) 

where the cordword 7)2 is power scaled with P a l 2 to ensure it can be recovered from noisy observation. 
To avoid interference from the other transmitters, we assume the new symbols u{2) and v{2) satisfy the 
power constraint E(|u(2)| 2 ) = E(|v(2)| 2 ) < P a . The received signals at both receivers are given as: 

y{2) = h* lhl (2)P a /% + /i H n (2)^(2X2) + h" 12 (2)hj; 2 (2)v(2) +e(2) (67a) 

„ ' v ' " v ' 

til * 12 * 13 

z(2) = h* 2hl (2)P a /% + h" 22 {2)hi 2 {2)v{2) + ^ 1 (2)/^ 2 L 1 (2)n(2) + 6(2). (67b) 

Note that in the received signal y{2), E(|tn| 2 ) ~ P, E(|ti 2 | 2 ) ~ P a , while IE(|t 13 | 2 ) - P° is at noise 
level. With distortion D ~ p-(i-a) ; both receivers can retrieve 772 with high probability by treating t 2 \ 
and t 22 as noise [17]. By removing fj 2 from the received signals, u(2) and v{2) can be recovered with 
high probability as long as their power constraints are satisfied. 

Slot-3: The transmission in the third time slot is similar to that in the slot-2, where the index 
corresponding to 771 chosen from X2 is encoded to c\ and transmitted as a common message together 
with another new symbol v(3) from Tx-2, while only one new symbol u(3) intended to Rx-1 is sent 
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from Tx-1. By omitting the encoding and decoding process of c\, the equivalent transmitted signals are 

xi(3) = fe^i(3)«(3) (68a) 



x 2 (3) 







+ hf 2 (3)v(3) 



where the new symbols u(3) and v(3) satisfy the power constraint E(|u(3)| 2 ) = 
received signals at both receivers are given as 

2/(3) = h* 12>1 (3)P a / 2 m + ^ n (3)^(3)n(3) + h» 12 (3)hi 2 {3)v{3) + e(3) 
z{3) = h* 22 , (3)1**/% + K 2 (3)hi 2 (3)v(3) + ^ 1 (3)h 2 L 1 (3)n(3) + 6(3) 



(68b) 



f(3)| 2 ) < P a . The 



(69a) 
(69b) 



2/(1) - *7i 




^i(l) 


SBl(l) + 


e(l) + Ai 








.^i(l). 




-A 2 



Similarly to the slot-2, r)i is retrievable at both receivers by treating other terms as noise, and it(3) and 
v(3) can be also recovered respectively by subtracting f)\ from the received signals at both receivers. 

At the end of the third slot, u(2), u(3), rji and rj 2 can be successfully recovered at the Rx-1. As was 
modeled in [16, 17], an equivalent MIMO can be formulated to find the symbols u(l): 

(70) 

[hkwj l - A2 J 

for the Rx-1, and it is similar for the Rx-2. 

Lemma 2. The vertex (^p, ^-jp) of DoF region is achievable by the above scheme. 

Proof: We outline the main idea of the proof here and please refer to Appendix for details. 

At the Rx-1 for instance, we transform the original signal model into an equivalent 2x2 point-to-point 
MIMO system model for u(l) (resp. v(l) at the Rx-2), together with two parallel SISO signal models 
respectively for u(2) and u(3) (resp. v{2) and v(3) at the Rx-2). For the MIMO model, we obtain the 
DoF of 2 - a, while get a DoF for each parallel SISO model, and finally 2 ~ a + 2a = DoF is achieved 
per user. ■ 

2) Achievability of (1, a): In the following, we extend the Han-Kobayashi scheme [1] here to achieve 
the vertex (1, a). 

The symbols sent from the Tx-1 consists of two parts u c and u p , where only u p is precoded by using 
imperfect current CSIT Simultaneously, one symbol v p intended to Rx-2 is sent from the Tx-2 with ZF 
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precoding. The transmission can be given as 

+ h 21 u p (71a) 

x 2 = h± 2 v p (71b) 

where the transmitted symbols are assumed to satisfy the power constraints E(|-u c | 2 ) < P, ¥,(\u p \ 2 ) = 
IE (bp | 2 ) < P a . Although the symbol u c is decodable by both receivers and hence referred to as a 
common message, it is only desirable by Rx-1. On the other hand, we refer to u p , v p as the private 
messages which can only be seen and decoded by their corresponding receivers. 
At the receiver side, we have 

V = h iiA u c + h^h^Up + h H 12 hi 2 v p +e (72a) 

N v ' N v ' 

z = h 21 x u c + h" 22 h\ 2 v p + h 21 h 21 u p +b, (72b) 

V v ' V v ' 

where the terms carrying common message are with approximated power P, while those carrying private 
messages are E(|t/h| 2 ) = E(|r/22| 2 ) ~ P a , and the interference terms E(|r/i2| 2 ) = E(|?72i| 2 ) ~ P° are at 
noise level according to (9). 

By firstly treating r]n,r]i 2 as noise, Rx-i can recover the common message u c with high probability. 
Then, the private messages u p and v p can be retrieved from the received signals after u c being subtracted 
at the Rx-1 and Rx-2, respectively. 

Lemma 3. The vertices (I, a) and (a, 1) of DoF region are achievable. 
Proof: Define 

V = Vn + V12 + e (73a) 
z = r] 2 2 + mi + b- (73b) 



Xi 
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For both receivers, the achievable rate can be given by 

I(u c , u p ; y\7) = I(u c ; y\7) + I(u p ; y\7, u c ) (74) 

= I(u c ;y\7) + I(u p ;y'\7) (75) 

= Elog (' + \JttS^ ff ) + Elog + Jfw) (76) 

= (l-a)logP + alogP + 0(l) (77) 

= logP + 0(l) (78) 



for the Rx-1, and 



I(v p ; z\7) = I(v p ; z\7, u c ) + I(v p ; u c \7) - I(v p ; u c \7, z) (79) 
= I(v p ;z\7,u c ) =I{v p -z'\7) (80) 

= Elogfl + 1 l !? 2 , l ' l9 l (81) 



M 2 + l&| 2 , 

= alogP + 0(l) (82) 

for the Rx-2, where (80) holds because u c and v p are independent. 

The DoF for both receivers can be simply obtained by definition. The other vertex (a, 1) can be 
achieved by swapping the roles of Tx-1 and Tx-2. This completes the proof. ■ 

Note that the vertices (1,0) and (0, 1) are achievable by letting one pair communicate while keeping 
the other one silent. In conclusion, all vertices of the DoF region for two-user MISO interference channel 
are achievable, and in turn the entire region can be achieved by time sharing. 

VI. Extension to MIMO Case 

Here, we extend the aforementioned MISO case to a class of MIMO settings with antenna configuration 
(M, M, N, N), where M antennas at each transmitter and N antennas at each receiver, satisfying M > 
2N. This includes a generalized MISO setting with more than 2 antennas at each transmitter. The discrete 
time baseband signal model is given by 

y(t) = H n (t) Xl (t) + H 12 (t)x 2 (t) + e(t) (83a) 

z(t) = H 21 {t) Xl (t) + H 22 (t)x 2 (t) + b(t), (83b) 

for any time instant t, where Hji(t) € C NxM is the channel matrix from Tx-i to Rx-j; e(t),b(t) ~ 
Xc (0, Ijv) are normalized AWGN vectors at the respective receivers; the coded input signal Xi(t) G C Mxl 
is subject to the power constraint E(||£Cj(t)|| 2 ) < P, Vi. 
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In analogy to the MISO case, we have the optimal DoF region of two-user time correlated (M, M, N, N) 
MIMO interference channel. 

Theorem 2. In the two-user (M, M, N, N) MIMO interference channel (M > 2N) with perfect delayed 
CSIT and imperfect current CSIT, the optimal DoF region can be characterized by 



di<N 

d 2 <N 
di + 2d 2 < N(2 + a) 
2di + d 2 < N(2 + a). 



(84a) 
(84b) 
(84c) 
(84d) 



Remark: The DoF region is irrelevant to the number of transmit antennas as long as M > 2N. For 
the M x 1 MISO case (M > 2), the DoF region is identical to that when M = 2, coinciding with the 
region for two-user MISO broadcast channel. 

Following the same strategy as the MISO case, we first provide the outer bound and then show the 
region confined by the outer bound is achievable. 



A. Outer Bound 

The outer bound can be simply extended from the MISO case. To avoid redundancy, we outline the 
main difference but omit the similar parts. By defining similarly y'(t), z'(t), U(t), i.e., 



we have 



Define 



y'(t)±H 12 (t)x 2 (t) + e(t) 
z'(t)±H 22 (t)x 2 (t) + b(t) 

u(t) 4 {{y\i)}lzlAz\i)}UAH(i)}UAH(i)}t =1 }, 



nRi < nN logP - ^ h(y' (t)\U(t) , H{t)) + n ■ 0(1) + ne r . 
t=i 

n 

nR 2 < J2h{y'(t),z'(t)\U{t),H(t))+ne n . 



t=i 



S(t) 



H 12 (t) 
H 22 (t) 



S(t) 



H 12 (t) 
H 22 (t) 



(85) 
(86) 
(87) 

(88) 
(89) 

(90) 
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and we upper-bound the weighted difference of two conditional differential entropies by 

h(y'(t), z'(t)\U(t), H{t)) - 2h(y'(t)\U(t), H{t)) (91) 
<E d(t) max max E s(t)|S(t) (logdet(I + S(t)K u (t)S H (t)) - 21ogdet(I + H 12 {t)K u {t)H" l2 {t))) (92) 

<E S(t) max max E s(t)|S(t) (logdet(I + H 22 (t)K u (t)H^ 2 (t)) - logdet(I + H l2 (t)K u {t)H H 12 {t))) (93) 

1 ' c ho, P (S(t)) y y ' 

< Nalog P + 0(1) (94) 

where K u (t) possesses the same definition as that in the MISO case and the last inequality is obtained 
according to the following Lemma. 

Lemma 4. For an N x M random matrix (M > N) H = H + H where II is independent of H and 
whose entries satisfy hij ~ 3Mc(0, a 2 ), given any K y with eigenvalues Ai > ■ ■ ■ > \m, we have the 
following upper and lower bounds: 

N 

E^Iog<fe/(I + HKH") <^log(l + \\H\\ 2 \) + 0{1) (95) 

i=i 

N 

Ejjlogdet(I + HKH") > ^ log(l + A^V) + O(l). (96) 

i=i 

The difference of the upper and lower bounds can be further bounded by 

log(l + ||H|| 2 Ai) - log(l + A^V) < - log(<r 2 ) + O(l) (97) 
where ( — jf YliLi ii>(N — i + 1) and ip(x) is the digamma function that given by [23, 24] 

x—l ^ 

rj)[x) = + < lux (98) 



/or integer x, where 7 zj Euler's constant. 



Proof: Please refer to Appendix for details. ■ 
Hence, we can outer-bound the weighted sum rate as 

n(Ri + 2R 2 ) < nN(2 + a) log P + n ■ O(l) + ne n (99) 
and similarly obtain another outer bound by exchanging the roles of Rx-1 and Rx-2, i.e., 

n(2R 1 + R 2 ) < nN(2 + a) log P + n ■ 0(1) + ne n (100) 
and therefore the outer bound of the DoF is obtained by the definition. 
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B. Achievability 

For the achievability, the vertices (N,Na), (Na,N), and ( N ( 2 + a ) ; N ( 2 + a ) ^ we a jj achievable and 
the achievable schemes can be simply extended from the MISO case. Here, we only show the achievable 
scheme for the vertex ^ N{2+a) ^ N(2+a) ^ ^ j nstance p or t ^ e achievability of (N, Na) and (Na, N), the 
extension is similar and straightforward. In the extended scheme, three time slots are consumed, where 
the transmitted signals are detailed as follows: 

Slot-1: The transmitted signals from both transmitters are given by 

= [q 2 i(i) Q^i(i)] u ( l ) ( 101a ) 

^2(1) = [Q 12 (l) Qi 2 (l)] v(l) (101b) 

where tt(l) G C 2Nxl , v(l) G C 2Nxl are assumed to satisfy Edl^l)!) 2 ) = E(||v i(l)|| 2 ) < P and 
Qji(t) G C MxN , Q±(t) G C MxN which satisfy 

Q 2l (t) C a{H 2 i(t)} Qi 2 (t) C a{ffi 2 (t)} (102a) 

Q^(t) C N{H 2 i(*)} Qi2(*) C N{ffi 2 (t)} (102b) 

where and represent range and null spaces, respectively. Note that the range space is with 
dimension N whereas the null space is with dimension M — N. Similarly to the MISO case, the estimation 
error satisfies E (ll-Hj-^Q^t)!! 2 ) ~ P' a . 
At both receivers, we have 

y(l)=H 11 (l)x l (l)+r ]l + e(l) (103a) 
z(l) = H 22 (l)aJ2(l) + »72 + b(l), (103b) 
where the interference vectors overheard at both receivers are 

m = H 12 (l)x 2 (l) eC Nxl (104a) 
= /fi 2 (l)Q 12 (lM(l) + H l2 (l)Qi 2 (l)v 2 {\) (104b) 
rj 2 = H 21 (l) Xl (l)eC Nxl (104c) 
= H 2 i(l)Q 2 i(l)ui(l) + i/ 21 (l)Q 2 L 1 (l)w 2 (l) (104d) 
where Wi(l), w 2 (l), v±(l) and v 2 (l) are all N x 1 vectors. By balancing the allocated power among 
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those vectors, i.e., 

E(|| Wl (l)|| 2 ) = P 1 -" E(|| U2 (1)|| 2 ) = P - P 1 "" (105) 

E(||^(l)|| 2 ) = P 1 "" E(||*, 2 (l)|| 2 ) = P - P 1 -" (106) 

we approximate the total power of interference vectors as E(||?7i|| 2 ) ~ pi-" anc j E(||^ 2 || 2 ) P 1_a . 
A set of source codebooks {%u, X 2 j, z = 1, - • • , N} with size (1 — a) log P bits each are generated to 
represent the quantized elements of the interference vectors r\i and r\\ at the Tx-1 and Tx-2, respectively 3 . 
The codewords representing the elements of r]2 and rji are chosen uniformly from {Xu} and {X 2 «} and 
concatenated as 172 and 771, respectively. As stated in MISO case, the indices of rji are encoded to c« 
using a Gaussian channel codebook and then forwarded as common messages to both receivers in the 
following two slots. For the sake of simplicity, we omit the channel encoding and decoding process of 
Ci, and therefore, it looks as if r)i itself is conveyed. 

Slot-2: The objective of the slot-2 is to convey the codeword vector 172 whose information is carried 
on a coded common message C2 together with a new symbol vector at the Tx-1, while only a new 
symbol vector is sent at the Tx-2. By omitting the encoding and decoding process of C2, the equivalent 
transmitted signals are 

xi(2) = P a/2 Q2i{2)m + Q 2 L i(2)w(2) (107a) 

x 2 (2) = Qi 2 (2)v(2) (107b) 

where u(2) G C Nxl and v{2) e C Nxl . We assume E(||w(2)|| 2 ) < P a and E(||^(2)|| 2 ) < P a to ensure 
they are recoverable. By treating u(2) and v{2) as noise, JV x 1 vector r) 2 is retrievable with high 
probability provided N linearly independent equations at both receivers. After that, u(2) and v{2) are 
also recoverable from N linear equations at the Rx-1 and Rx-2 by subtracting 1)2 from the received 
signals. 

Slot-3: The objective of the slot-3 is the same as the slot-2 but with the exchanged roles between Tx-1 
and Tx-2. The equivalent transmitted signals are given as 

xi(3) = Q^(3)«(3) (108a) 

a; 2 (3) = P a/2 Qi 2 (3)T7i + Qi>(3M3) (108b) 

where w(3) G C Nxl , v(3) G C Nxl are assume to satisfy power constraint E(||w(3)|| 2 ) < P a and 
E(||i)(3)|| 2 ) < P a . By firstly treating u(S) and v(3) as noise, N x 1 vector fji can be recovered with 

3 Here, the quantization is made on each element of the vector rji regardless of their mutual correlation. 
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high probability given N linearly independent equations at both receivers. Similarly to the slot-2, u(3) 
and v (3) can also be recovered from the subtracted received signals. 

At the end of slot-3, N x 1 vectors i)i and i) 2 can be all recovered at both receivers, serving to cancel 
the overheard interference as well as to provide additional linearly independent equations for v(l) and 
u(l), respectively. With 2N linearly independent equations, the 2N x 1 vectors u(l) and v(l) are both 
recoverable with high probability at its respective receiver. 



The proof of the achievable DoF pair y— ^ — -, — ( — — -J is similar to that in the MISO case. Take 
Tx-l/Rx-1 pair for example. The original channel model can be transformed to an equivalent 2N x 2N 
point-to-point MIMO channel which conveys symbol vector u(l) yielding N(2 — a) DoF, and two parallel 
N x N MIMO channels which carry u{2) and w(3) respectively yielding Na DoF each. Hence, the 
total N(2 + a) DoF is achieved within three time slots, and in turn the DoF pair ^ N ( 2 + a ) ; N ( 2 + a ) ^ j s 
achievable by symmetry. The detailed proof can be analogically derived according to the MISO case and 
hence omitted here. 



We characterize the DoF region of the two-user MISO and certain MIMO interference channels where 
the transmitter has access to both delayed CSI as well as an estimate of the current CSI. In particular, 
these results are suited to time-correlated fading channels for which a latency-prone feedback channel 
provided the transmitter with the delayed samples, based on which a prediction mechanism can be applied 
to obtain the current imperfect CSI. Our DoF region covers a family of CSIT settings, coinciding with 
previously reported results for extreme situations such as pure delayed CSIT and pure current CSIT. For 
intermediate regimes, the DoF achieving scheme relies on the forwarding to users of a suitably quantized 
version of prior interference obtained under imperfect linear ZF precoding at the two transmitters. 




VII. Conclusion 
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Appendix 

A. Proof of Lemma 2 

We consider Rx-1 for example to illustrate d\ = 2 4^- is achievable. By symmetry, d 2 at the Rx-2 can 
be similarly obtained. Given the received signal model 



y(l) = h H 11 (l) a;i (l) + r ?1 + e(l) 

y{2) = h* 11A (2)P a /% + h H 11 (2)^ 1 (2)u(2) + h" 12 (2)hi 2 (2)v(2) +e(2) 

v — : — „ > . ' <■ » ' 

til *12 *13 

2/(3) = ^,i(3)^ /2 ')i + /i H ii(3)^i(3)«(3)+/ l H 12 (3)^(3)t;(3) +e(3) 

> — : — „ ' v v ' v v ' 

Sn s 12 S13 

we have the achievable rate of Tx-1 and Rx-1 pair, i.e., 

I(u(l),u(2),u(S);{y(t)} 3 t=1 \7) 

= I(u(l); {y(t)} 3 t=1 \7) + I(u(2), u(3); {y(t)}t=i Hi), 7)- 
For the first term, by defining 

y'(l) = h* 12;1 (3)P a / 2 y(l)-y(3) 

= h* 12!l (3)P a / 2 h" u (l) Xl (l) + ^ 2;1 (3)P a / 2 (A! + e(l)) - sis - s 13 - e(3) 
y\2) = h* llA (2)P a / 2 h" 2l (l) Xl (l)) - h* lhl (2)P a / 2 A 2 + t 12 + t 13 + e(2). 

we have 

7(u(l); {y(i)}? =1 |T) = J(«(l); </(!), y'(2), 2/(3) IT) 
= /( W (l);y'(l), y '(2)|T) 



(109) 
(110) 

(111) 



(112) 
(113) 

(114) 
(115) 
(116) 

(117) 
(118) 



where y(3) is independent of u(l). 

By formulating an equivalent MIMO channel with Gaussian input u(l) and output {y'(l),y'(2)}, i.e., 



V(i)" 




^ 2>1 (3)fc H n (l) 


y(2)_ 




5l,l(2)^2l(l)_ 



[fe 21 (l) %!(!)]«(!) 



(119) 



ff 



+ 



^2,i(3)^ a/2 (Ai + e(l)) - si2 - sis - e(3) 
-h* u 1 (2)P a / 2 A 2 + t 12 + ii 3 + e(2) 



(120) 
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where E(|t 12 | 2 ) = E(|s 12 | 2 ) ~ P°, E(\t 13 \ 2 ) = E(\s 13 \ 2 ) ~ P°, E(|Aj| 2 ) ~ P°, and E(|e(t)| 2 ) ~ P°, 
we have 

I(u(l);y'(l),y'(2)\7)=I(P a / 2 Hu(l);y'(l),y'(2)\7) (121) 
= ^(P a / 2 ff W (l)|T) - h(P a / 2 Hu(l)\y'(l),y'(2),7) (122) 
= / l (P a / 2 J/ W (l)|T) - h(n\y'(l),y'(2),7) (123) 
> /i(P a / 2 iYw(l)|T) - h(n) (124) 
= Elogdet(P a W" 1 li"Kili" H ) (125) 
= (2 -a)logP + 0(l) (126) 

where (123) holds because translation does not change differential entropy; (124) follows the fact that 
conditioning reduces entropy; (125) is according to the definition of differential entropy with Gaussian 
distributed input, where K\ = E{w(l)« H (l)} and W = E{nn"}; and (126) is obtained because K\ ~ 
diagjP 1 -", P}, W ~ P a I and H does not scale as P. 
For the second term, we have 

I(u(2),u(3);{y(t)} 3 t=1 \u(l),7) (127) 

= /(«(2),«(3);y(2),y(3)|u(l),T) (128) 
= /(n(2), «(3); y(2), y(3)|u(l), ft , f) 2 , T) + J(u(2), «(3); 771, fc|tt(l), 7) 

- J(u(2), u(3); m , %|«(1), 2/(2), y(3), T) (129) 

= 7(n(2), «(3); y(2), y(3)|u(l), ft , r) 2 , T) (130) 

= I(u(2),u(3);y"(2),y"(3)\7) (131) 

where (128) holds because y(l) is independent of {u(2), «(3)}; (129) is obtained by the chain rule of 
mutual information; (130) is because u(2),u(3) are independent of 7)1, r) 2 ; (131) is due to 

y"(2) = y(2) - (2)P Q / 2 r} 2 = t 12 + t 13 + e(2) (132) 
y"(3) = y(3) - /»; 2>1 (3)P a / 2 r)! = s 12 + a 13 + e(3) (133) 

are independent of 771 , 772 and u(l). 

By formulating an equivalent parallel channel with Gaussian input u{2) (resp. u(3)) and output y'(2) 
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(resp. y" (?>)), we have 

J(n(2), n(3);/(2),y"(3)|T) (134) 
= /(«(2);y"(2)|T) + /(u(3);y"(3)|T) (135) 

= Elog ( x + ctw ) + Elog + cttct ) (136) 

= 2alogP + 0(l) (137) 

where (135) is due to the chain rule of mutual information and the independence between u{2) (resp. y"(2)) 
and u(3) (resp. y"(3)); (136) is according to the definition of differential entropy with Gaussian distributed 
input; the last equality is due to E(\t 12 \ 2 ) = E(|si 2 | 2 ) ~ P a , E(|ti 3 | 2 ) = E(|s 13 | 2 ) ~ P°, and 

E(|e(2)| 2 ) = E(|e(3)| 2 )~P . 

Substituting (126) and (137) into (113), by the definition of DoF 

_ i ]im i(«WMVMZYAy(t)}UV 

3 P^oo log P 

we conclude that d± = is achievable. By symmetry, di can be simultaneously obtained in a similar 
way. This completes the proof. 

B. Proof of Lemma 4 

We assume without loss of generality M > N. For M < N, the similar steps can be accordingly 
pursued. 
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First for the upper bound, we have 

EjjlogteL(I + HKH") =E^logdet(I + U h ^hV^KV h ^hU h h ) (139) 

= E^logdet(I + S| f V^lirV f/ ) (140) 

< E^logdet(I + X max ^ 2 H )V^KV H ) (141) 

N 

= ^E^log(l + X^HH^V^KVh)) (142) 



i=i 

N 



<J2 E h Ml + X mBX (HH")Xi) (143) 

i=l 
AT 

<5^E^log(l + ||ff||iAi) (144) 

z=l 
AT 

< ^E^log(l + (II^Hl + ||jar||g)Ai) (145) 
1=1 

AT 

<^og(l + (\\H\\ 2 F + MNa 2 )X t ) (146) 
i=l 

= f> g( i + \mtxi) + Eiog (i + t^St) 

i=l i=l V i +ll- H llF A i/ 

TV JV / 2 \ 

< £ log(l + lltfUlAO + £ lQ g 1 + j£^T d48) 
i=l i=l \ II Hf / 



N 



= J>g(l+ ||ff||?Ai) + 0(1) (149) 

i=i 

where in (139), // = U H ^ H V^ with S H e C WxJV and V/j G C MxW ; (140) comes from the equality 
det(I + AB) = det(l + BA); (143) is due to Poincare Separation Theorem [25,26] that \i(V£KV H ) < 
Xi(K) for i = 1, • • • ,N; (144) is from the fact that \\H\\ 2 < \\H\\ F ; (146) is obtained by applying 
Jensen's inequality to a concave function. 
For the lower bound, we have 

E#logdet(I + if.K'if H ) ^E^iVlog^l + de^i/Ki/") 1 /^ (150) 

= E^iVlog ^1 + exp lndet(H KH H )J ^ (151) 

> iVlog ^1 + exp ^E^lndet(J/ii:Ji" H )^ (152) 
where (150) comes from Minkowski's inequality and (152) is from Jensen's inequality by noticing that 
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log(l + e x ) is a convex function in x. Hence, the expectation part can be simplified further as 

Ejj\ndet(HKH") = E^ lndet (*A* H ) (153) 
>Ejjlndet(*'A'*' H ) (154) 

N 

= lnJjA i + E^lndet(* / * /H ) (155) 



i=i 

N 



> InJjAi + E^lndet(fl r V'V /H jy rH ) (156) 

i=i 

N 

= ln]^ + E^]ndet(ff'.H r/H ) (157) 

i=i 

N N 

= ln[|(A i a 2 ) + ^V(^-« + l) (158) 
i=i i=i 

where $ = i/V G (QWxM w j^jj y being the unitary matrix containing the eigenvectors of K, i.e., 
K = VAV" with A = diag(Ai, ■ ■ ■ , X M ); in (154), <£' = HV e C JVxJV with V being the first TV 
columns of V where A' = diag(Ai, • • • , Ajv) and *A$ H ^ <&'A'<I>' H ; (156) is due to the fact that the 
capacity of a Ricean fading channel is no less than that of a Rayleigh fading channel in all SNR region [27, 
28], by treating H as the deterministic line-of-sight component; (157) is because H is independent of V 
and H' is an N x N i.i.d. complex Gaussian matrix with each entry being zero-mean and variance-a 2 ; 
the last equation is according to the isotropic assumption and obtained from [23,24]. 
By the definition of (, we have 

Nlog ^1 + exp ^E^lndet(JTKfT)^ > N log ^1 + exp In [J(Ai<T 2 e c )^ (159) 

= Nlog(^+(j[(\ i a 2 e^ j (160) 

1 N + 
>AT-^[log(A i( 7V)] (161) 

i=i 

N 

> ^log(l + A^ 2 e^) - N (162) 



i=i 

N 



^log(l + A^V) + 0(1) (163) 



i=l 



where (x) + = max{x,0} and last inequality due to the fact (log(x)) + > log(l + x) — 1. 
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The difference between upper and lower bounds can be further bounded as 



log(l + \\H\\ 2 Xi) - log(l + \i<J 2 e<) < log 




(164) 



(165) 



< -log(<7 2 ) + log(||H|| 2 + <7 2 e c ) 
<-log(a 2 ) + log(||i7|| 2 + e^) 



(166) 



(167) 
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